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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention relates to the preparation of multivalent and murtispecific binding proteins. In particu- 
lar, the invention relates to the preparation of antigen binding proteins comprising a plurality of binding units linked in 
series by means of intervening polypeptide linker groups, the amino acid sequence of which linker groups confer 
restricted conformational flexibility. 

10 BACKGROUND OF THE INVENTION 

[0002] There is considerable interest in the preparation of multivalent and/or murtispecific antigen binding proteins. 
Antigen binding proteins which are multivalent (that is, comprise more than one antigen binding site), more especially 
those which are also murtispecific (where the antigen binding sites have differing antigen specificities) have found par- 

15 ticular application in the fields of diagnosis or therapy, for example, where the construction of binding proteins having 
binding activity .against both target site and diagnostic or therapeutic agent allows for targeted delivery of the diagnostic 
or therapeutic agent to the intended site of action. Other uses for which multivalent and multispecif ic binding proteins 
have been proposed include assays, such as immunoassays and agglutination assays, and purification processes. 
[0003] Those multivalent, murtispecrfic antigen binding proteins which have been described in the literature to date 

20 rely, in general, on the association of antibody light and heavy chain variable domains for the formation of the antigen 
binding site. 

[0004] Thus, constructs comprising two or more polypeptide chains are described in WO 94/09131 (Scotgen Lim- 
ited) and WO 97/14719 (Unilever) and WO 97/38102 (Unilever); multivalent molecules comprising two or more single 
chain Fv molecules linked together are described in WQ93/1 1 1 61 (Enzon Inc.) and WO 94/13806 (Dow Chemical Co.). 

25 [0005] WO 94/04678 (Casterman et al) describes immunoglobulins capable of exhibiting the functional properties 
of classical, four chain, immunoglobulins but which comprise two heavy polypeptide chains only and are naturally 
devoid of light polypeptide chains. Fragments corresponding to isolated Vh domains or to Vh dimers linked by the hinge 
disulphide are also disclosed. These immunoglobulins, which may be isolated from Camelids, do not rely on the asso- 
ciation of heavy and light chain variable domains for the formation of the antigen-binding site; instead, the heavy chain 

30 variable domain (hereinafter Vhh) alone forms the complete antigen binding site, constituting a single domain binding 
site. 

[0006] In their later patent application, WO 96/341 03, Casterman et ai disclose multivalent, multispecif ic constructs 
comprising Vhh fragments combined with a linker sequence. Suitable linker sequences disclosed and exemplified are 
derived from sequences corresponding to the hinge domain of an immunoglobulin devoid of light chains. 

35 [0007] In the Applicant's co-pending patent application number PCT/EP98/06991 , filed 26th October 1998, there 
are disclosed multivalent, multispecif ic antigen-binding proteins comprising a polypeptide comprising in series two or 
more single domain binding units which are preferably variable domains of a heavy chain derived from an immunoglob- 
ulin naturally devoid of light chains. The individual single domain binding units may suitably be linked by means of pep* 
tide linkers, preferably flexible peptide linkers, which allow the variable domains to flex in relation to each other with the 

40 aim of ensuring that they can bind to multiple antigenic determinants simultaneously. 

[0008] There remains a continuing need for the development of improved methods for producing multivalent and/or 
multispecif ic binding proteins, especially antigen binding proteins. In particular, there is commercial interest in produc- 
ing molecules which not only have improved binding activity but which also can be produced economically on a large 
scale. 

45 

SUMMARY OF THE INVENTION 

[0009] In a first aspect, the invention provides the use of a polypeptide group, the amino acid sequence of which 
group confers restricted conformational flexibility, as a linking group to link binding units in a multivalent binding protein. 
so [0010] "me invention also provides a multivalent binding protein comprising a plurality of binding units linked by 
means of intervening polypeptide linker groups, the amino acid sequence of which linker group confers restricted con- 
formational flexibility. 

[001 1 ] The invention further provides a nucleotide sequence encoding a multivalent antigen binding protein accord- 
ing to the invention and cloning and expression vectors comprising such nucleotide sequences. Also provided are host 

55 cells transformed with vectors comprising such nucleotide sequences. 

[001 2] As used herein, a 'multivalent binding protein* is a protein which has more than one binding units which allow 

— for specific binding with a molecule partner in a binding pair. Included within this are bivalent, trivalent and so on. Exam- 
ples of suitable binding units include antigen binding domains of antibodies, binding domains of receptors such as hor- 
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mone receptors, lectins, enzymes, and cell adhesion molecules. A 'multivalent antigen binding protein' is a protein 
which has more than one antigen binding unit. 

[001 3] An 'antigen binding unit' is any structure which exhibits antigen-binding activity. This may be an antibody or 
an immunologically active fragment thereof. An 'antibody 1 refers to an immunoglobulin which may be derived from nat- 
ural sources or synthetically produced. Unless indicated otherwise, 'antibody* and immunoglobulin' are used synony- 
mously throughout this specification. 

[0014] An antibody fragment is a portion of a whole antibody which retains the ability to exhibit antigen-binding 

activity. The antigen binding site may be formed through association of antibody light and heavy chain variable domains 

or may comprise individual antibody variable domains, constituting a single domain binding site. 

[001 5] Suitable fragments include Fab (comprising an antibody light chain associated with the V H and C H1 domains 

of an antibody heavy chain), Fv (comprising the variable domains of antibody heavy and light chains associated with 

each other) and scFv (comprising an antibody V H domain linked to a V H domain by a flexible peptide linker) fragments. 

Where the antigen binding site comprises a single variable domain, this may be a heavy chain variable domain, most 

suitably a heavy chain variable domain derived from an immunoglobulin naturally devoid of light chains. 

[001 6] 'Restricted conformational f lexibil rty' relates to restriction of movement of the antigen binding units about the 

backbone of the intervening polypeptide linker group. 

[0017] The present invention may be more fully understood with reference to the following description, when read 
together with the accompanying drawings. For convenience, an antigen binding protein comprising two single binding 
units is hereinafter referred to as a t>i-head\ 

BRIEF DESCRIPTION OF THE DRAWINGS 



[0018] 

25 Figure 1 



shows a nucleotide sequence of the Pstl-BstEII insert of plasmid p(JR4640, encoding the heavy chain var- 
iable domain of an anti-RR6 antibody (denoted R9) from a llama. 



30 



35 



40 



Figure 2 shows the nucleotide sequence of the Pstl-BstEII insert of plasmid pUR4601 , encoding the heavy chain 
variable domain of an anti-hCG antibody (denoted H14) from a llama. 

Figure 3 shows a map of plasmid pUR461 9. 

Figure 4 shows the nucleotide sequence within plasmid pUR4619 which encodes an anti-hCG-anti-RR6 bispecific 
biheaded antigen-binding protein (denoted HI4-R9), missing the first 4 and last 3 amino acids. 

Figure 5 shows the A405 signals of an ELISA to determine bispecrficrty of various HI4-R9 biheads. 

Figure 6 shows the scores achieved in a rapid assay technology (RAT) format assay following the detection of 1 
lU/ml hCG (human chorionic gonadotropin protein) with various anti-hCG-anti-RR6 bihead antigen bind- 
ing proteins derived from a llama wherein the anti-hCG and anti-RR6 fragments are linked as follows (see 
Example 1.5): 



45 



50 



55 



Number 


Linker 


1 


no linker (directly attached) 




2 


G-T-S-G-S 


(SEQ. ID NO. 1) 


3 


S-S-S-A-S-A-S-S-A 


(SEQ. ID NO. 2) 


4 


G-S-P-G-S-P-G 


(SEQ. ID NO. 3) 


5 


A-T-T-T-G-S-S-P-G-P-T 


(SEQ. ID NO. 4) 


6 


A-N-H-S-G-N-A-S 


(SEQ. ID NO. 5) 



Figure 7 shows a comparison of the sensitivity of detection of hCG in a RAT assay using various biheads (see 
Example 1.5). 
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DETAILED DESCRIPTION OF THE INVENTION 

[0019] The invention is based on the unexpected finding that by using a polypeptide linking group conferring 
restricted conformational flexibility to link together antigen binding units, multivalent antigen binding proteins having 
s advantageous binding affinity, as demonstrated by their increased sensitivity of diagnosis and detection, are obtained. 
Furthermore, constructs according to the invention may conveniently be produced at high yields economically and effi- 
ciently on a scale appropriate for industrial use. 

[0020] As is apparent from the discussion of the background to the invention above, to the extent that multivalent 
antigen binding constructs comprising separate binding units linked together have been described at all in the literature, 
10 the linking means has been provided by flexible peptide groups. Flexibility of conformation in the linker group has been 
considered desirable in order to allow the multivalent construct to assume the correct orientation to allow simultaneous 
binding of multiple antigens. 

[0021] Surprisingly, the present inventors have found that by restricting the conformational flexibility of the linking 
polypeptide group, multivalent antigen binding constructs having improved binding affinity may be obtained. This is 

75 entirely contrary to the teaching in the art that the linking group should desirably be flexible. 

[0022] The invention is applicable to the preparation of multivalent antigen binding constructs comprising antigen 
binding units where the antigen binding site is formed through association of antibody light and heavy chain variable 
domains. Preferably, however, the constructs prepared according to the invention comprise a plurality of single domain 
binding units, more particularly a plurality of heavy chain variable domains derived from an immunoglobulin naturally 

20 devoid of light chains such as may be obtained from lymphoid cells, especially peripheral blood lymphocytes, bone mar- 
row cells or spleen cells derived from Camelids as described in WO 94/04678 (Casterman et al) discussed above. An 
advantage of using single domain binding units which are heavy chain variable domains derived from Camelids is that 
they can readily and conveniently be produced economically on a large scale, for example, using a transformed lower 
eukaryotic host, as described in WO 94/25591 (Unilever), described above. 

25 [0023] Bivalent forms, that is having two antigen binding sites, of the multivalent antigen binding proteins prepared 
according to the invention are preferred but it will be appreciated that higher multivalent forms, which are also encom- 
passed in the present invention, may find application under suitable circumstances, for example where more than two 
antigens are required to bind, for example in processes for scavenging molecules from solution or processes where 
close proximity of molecules form the basis of an assay. 

30 [0024] Structural features which may suitably be incorporated into the linking polypeptide group in order to achieve 
the effect of restricting conformational flexibility according to the purposes of the invention would readily suggest them- 
selves to those skilled in the art. 

[0025] Accordingly, in one embodiment, the linker group preferably comprises one or more proline residues. 
[0026] Without wishing to be bound by theory, it is generally thought that the presence of a proline residue in a pep- 

35 tide sequence encourages the amino acid backbone of the peptide to adopt a beta-turn structural configuration, with 
the peptide backbone changing direction about the proline residue. Linker groups comprising other sequence features 
which promote the formation of a beta-turn configuration in the peptide backbone, such as peptide linkers containing 
valine residues or constrained residues such as 8-bicyclic and 5,9-bicydic tripeptide units (see, for example, Johannes- 
son et al, J. Med. Chem., 42, 601-608 (1999), may also suitably find application in the present invention. 

40 [0027] In another embodiment, peptide linker groups derived from naturally occurring proteins such as cell wall pro- 
teins (CWP), in particular, CWP1, or cellobishydrolases (CBH), such as CBH1P, which serve to restrict conformational 
flexibility or linker groups showing at least 50% homology thereto as determined by the ALIGN program of Dayhoff et al 
(1983), Methods Enzymol., 21. 524-545, may also suitably be used according to the invention. 
[0028] Peptide linker groups which encode a glycosylation binding site and/or are resistant to proteolytic attack may 

45 also suitably be employed. Here, the presence of a carbohydrate attached to the amino acid residues has the effect of 
restricting the flexibility of the peptide backbone. 

[0029] Conveniently, the polypeptide linking group according to the invention comprises from 4 to 30 amino acid 
residues, preferably from 5 to 15 amino acid residues. 

[0030] Preferred polypeptide linking groups according to the invention comprise an amino acid sequence selected 
50 from: 

S-S-S-A-S-A-S-S-A, (SEQ. ID NO. 2> 
G-S-P-G-S-P-G, (SEQ. ID NO. 3> 

A-T-T-T-G-S-S-P-G-P-T (SEQ. ID NO. 4) 

55 

[0031 ] It will be appreciated that although the invention has been described primarily by reference to antigen bind- 
ing proteins, it is equally applicable to proteins comprising other binding units as described above. References to anti- 
gen binding proteins will accordingly be understood to refer also to such other proteins unless the context dictates 
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otherwise. 

[0032] Multivalent antigen binding proteins according to the invention may be prepared by transforming a host by 
incorporating a gene encoding the polypeptide as set forth above and expressing said gene in said host. 
[0033] Suitably the host or hosts may be selected from prokaryotic bacteria, such as Gram-negative bacteria, for 
5 example E co//, and Gram-positive bacteria, for example B. subtilis or lactic acid bacteria, lower eukaryotes such as 
yeasts, for example belonging to the genera Saccharomyces, Kluyveromyces, Hansenuta or Pichia, or moulds such as 
those belonging to the genera Aspergillus or Trichoderma. 

[0034] Preferred hosts for use in connection with the present invention are the lower eukaryotic moulds and yeasts. 
[0035] Techniques for synthesising genes, incorporating them into hosts and expressing genes in hosts are well 
10 known in the art and the skilled person would readily be able to put the invention into effect using common general 
knowledge. 

[0036] Methods for producing antibody fragments or f unctionalised fragments thereof derived from the heavy chain 
immunoglobulin of Camelidae using a transformed lower eukaryotic host are described, for example in patent applica- 
tion WO 94/25591 and such techniques may suitably be applied to prepare constructs according to the present inven- 
ts tion. 

[0037] Proteins according to the invention may be recovered and purified using conventional techniques such as 
affinity chromatography, ion exchange chromatography or gel filtration chromatography. 

[0038] The activity of the multivalent binding proteins according to the invention may conveniently be measured by 
standard techniques known in the art such as enzyme-linked immunoadsorbant assay (ELISA), radioimmune assay 
20 (RIA) or by using biosensors. 

[0039] The following examples are produced by way of illustration only. 

[0040] Techniques used for the manipulation and analysis of nucleic acid materials were performed as described in 
Sambrook et al, Molecular Cloning, Cold Spring Harbor Press. New York, 2nd Ed., (1989) unless otherwise indicated. 
[0041] Restriction sites are underlined. 
25 HC-V denotes heavy chain variable domain. 

EXAMPLES 

Example- 1 Self Assembling 1 Llama Bi-heads Containing Linker Peptides on Latex to Assay hCG 

30 

1.1 Construction of Llama Bi-heads with Various Linkers 
a) Induction of humeral immune responses in llama 

35 [0042] Male llamas were immunised with a water in oil emulsion (1 :9 V/V, antigen in water: Specol (Bokhout et al, 
Vet. Immunol. Immunopath., 2:, 491-500 (1981)) subcutaneously and intramuscularly. Per immunisation site 0.75-1.5 
ml water in oil.emulsion was inoculated containing 100:g antigen. The antigens used were: hCG (Sigma), azo-dye RR6 
(ICQ which was coupled to BSA via its reactive triazine group. Immunisations were performed according to the following 
time table: The second immunisation was performed three weeks after the f irst The third was performed two weeks 

40 after the second immunisation. The immune response was followed by antigen specific ELISAs. 

[0043] The anti-RR-6 response was measured by using Nunc Covalink plates, which where coated with the azo- 
dye. After incubation with (diluted) serum samples, the bound llama antibodies were detected via a incubation with poly- 
clonal rabbit-anti-llama antiserum (obtained via immunising rabbits with llama immunoglobulins which were purified via 
ProtA and ProtG columns; ID-DLO), followed by an incubation with swine-anti-rabbit immunoglobulins (Dako) conju- 

45 gated with alkaline phosphatase. Finally the alkaline phosphatase enzyme-activity was determined after incubation with 
p-nitro-phenyl phosphate and the optical density was measured at 405nm. The anti-hCG response, was measured in 
essentially the same way using Nunc maxi-sorb plates coated with hCG. 



50 



b) Cloning, expressing and screening of llama HC-V fragments 
i) Isolation of gene fragments encoding llama HC-V domains 



[0044] From an immunised llama a blood sample of about 200m! was taken and an enriched lymphocyte population 
was obtained via Ficoil (Pharmacia) discontinuous gradient centrifugation. From these cells, total RNA was isolated by 
55 acid guanidium thiocyanate extraction (e.g. via the method described by Chomczynnski and Sacchi, Analytical Bio- 
chemistry, 162: 156-159 (1987). After first strand cDNA synthesis (e.g. with the Amersham first strand cDNA kit), DNA 
fragments encoding HC-V fragments and part of the long or short hinge region were amplified by PCR using specific 
primers: 
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PstI 

2B 5 ' - AGGT SMARCTGCAGS AGTCWGG- 3 ' 



(SEQ. ID NO. 6) 



5 



S = Cand G,M = AandC t R = AandG, W = AandT, 



10 




15 




20 



25 [0045] Upon digestion of the PCR fragments with Pst\ (coinciding with codon 4 and 5 of the HG-V domain, encod- 
ing the amino acids L-Q) and SsfEII (located at the 3'-end of the HC-V gene fragments, coinciding with the amino acid 
sequence Q-V-T), the DNA fragments with a length between 300 and 400bp (encoding the HC-V domain, but lacking 
the first three and the last three codons) were purified via gel electrophoresis and isolation from the agarose gel. 

30 ii) Construction of Saccharomyces cerevisiae expression plasmids encoding llama HC-V domains 

[0046] Plasmids pUR4547 and pUR4548 are Saccharomyces cerevisiae episomal expression plasmids, derived 
from pSY1 (Harmsen et al., Gene, 125: 1 15-123, (1993). From pSY1 the Pst\ site, located in front of the GAL7 promoter 
was removed after partial digestion with Pst\ t incubation with Wenow fragment and subsequent blunt end ligation. After 
35 transformation the desired plasmid could be selected on the basis of restriction pattern analysis. Subsequently, the 
BsfEII site in the Leu2 selection marker was removed by replacing the about 410bp AflWPMl fragment with a corre- 
sponding fragment in which the SsfEII site was removed via a three step PCR mutagenesis, using the primers: 



PCR-A: 



40 



[0047] 



pf mi 

BOLI 1 5 1 -GGGAATTCCAATAGGTGGTTAGCAATCG 



(SEQ. ID NO. 9) 



45 



(BstEII) 

BOLI 4 5 ' -GACCAACG TGGTCGCC TGGCAAAACG 



(SEQ. ID NO. 10) 



50 



55 
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PCR-B: 
[0048] 

5 (BstEII) 

BOLI 3 5 ' -CGTTTTGCC AGGCGACC ACGTTGGTC (SEQ. ID NO. 11) 

Aflll 

10 BOLI 2 5 ' -CCCCAAGCTTACATGG TCTTAAG TT GGCGT (SEQ. ID NO. 12) 



[0049] PCR-A was performed with primers BOL1 1 and BOLI 4 and resulted in an about 130bp fragment with the 
15 PfM restriction site at the 3'-end and the inactivated SsfEII site at the 5'-end. PCR-B was performed with primers BOLI 
2 and BOLI 3 and resulted in an about 290bp fragment with the AflU site at the 5'-end. The third PCR was with the frag- 
ments obtained from reaction A and B, together with the primers BOL1 1 and BOLI 2. 

[0050] Finally, the about 1 .8kb Sac\-Hind\ 1 1 fragment was replaced with synthetic fragments, having sequences as 
presented below, resulting the plasmids pUR4547 and pUR4548, respectively. 

20 

• Sacl/H/hcflll fragment of pUR4547 
[0051] 

25 

Sad (SEQ. ID NO. 13) 
GAGCTCATCACACAAACAAACAAAACAAAATGATGCTTTTGCAAGCCTTCCCTT 
I + ~+ + + + 54 

CTCGAGTAGTGTGTTTGTTTGTTTTGTTTTACTACGAAAACGTTCGGAAGGGAA 
30 MMLLQAFLF 

|-> SUC2 ss (SEQ. ID NO. 14) 

PstI 

35 TTCGTTTTGGCTGGTTTTGCAGCCAAAATATCTGCGCAGGTGCAGCTGCAGG 

55 + + + + + 1Q 5 

AAGGAAAACCGACCAAAACGTCGGTTTTATAGACGCGTCCACGTCGACGTCC 
LLAGFAAKISAQVQLQE 

BstEII Hindlll 
AGTCATAATGAGGGACCCAGGTCACCGTCTCCTCATAATGACTTAAGCTT 

106 + + + + + 155 

45 TCAGTATTACTCCCTGGGTCCAGTGGCAGAGGAGTATTACTGAATTCGAA 
ES**GTQVTVSS** 
HC-V cassette <-\ 

50 

and 



55 
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- SacVHind\\\ fragment of pUR4548 
[0052] 



Sad (SEQ ' 10 N °" 15) 

GAGCTCATCACACAAACAAACAAAACAAAATGATGCTTTTGCAAGCCTTCCTTT 

1 + + + + + 54 

CTCGAGTAGTGTdTTTGTTTGTTTTGTTTTACTACGAAAACGTTCGGAAGGAAA 

~" MMLLQAFLF 

|_> SUC2 ss (SEQ. ID NO. 16) 



15 



20 



30 



35 



40 



45 



50 



55 



PstT 

TCCTTTTGGCTGGTTTTGCAGCCAAAATATCTGCGCAGGTGCAGCTGCAGG 

55 + + + + + 

AGGAAAACCGACCAAAACGTCGGTTTTATAGACGCGTCCACGTCGACGTCC 

LLAGFAAKISAQVQLQE 



105 



BstEII 

AGTCATAATGAGGGACCCAGGTCACCGTCTCCTCAGAACAAAAACTCATC 

106 tcagta™ctccctgggtccagtg«^gaggagtcttgtttttgagtag 
s**gtqvtvsseqkli 

HC-V cassette <-!-» m y c tail 



155 



Hindi I I 

TCAGAAGAGGATCTGAATTAATGACTTAAGCTT 

156 + + + 188 

AGTCTTCTCCTAGACTTAATTACTGAATTCGAA 
SEEDLN** 



[0053J Both plasmids contain the GAL7 promoter and PGK terminator sequences as well as the "™^UCS>> 
stanal seouence In both plasmids the DNA sequence encoding the SUC2 signal sequence b followed by *e first 5 
SisTncS "SEE* of the HC-V domain Oncluding the Ml site), a stuffer sequ^ce. the las. ax codons 
Sn^.S-S)dth7HC-Vd 

In pUP.4548. this sequence is followed by eleven codons encoding the myc-tag. two stop codons, an Af/ll and HrndW 

[00541 Plasmids pUR4547 and pUR4548 were deposited under the Budapest Treaty at the Centraal Bureau voor 
ELZZ T*m on 18th August 1997 wHh deposition numbers: CBS ,00012 

In accordance with Rule 28(4) EPC, or a similar arrangement from a state not tang a contracting state of the EPC. it 
is hereby requested that a sample of such deposit when requested, will be submitted to an expert only 
00551 Upon digesting pUR4548 with Psfl and SsfEII. the about 6.4kb vector fragment was solated and leafed w.th 
h^sn-BsfETfragments of about 350bp obtained as described above. After 

troporation, transformants were selected from minimal medium agar plates (compnsing 0.7% yeast nrtrogen base. 2% 
glucose and 2% agar, supplemented with the essential amino acids and bases). 
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iii) Screening for antigen specific HC-V domains 

[0056] For the production of llama HC-V fragments with myc-tail. individual transformants were grown overnight in 
selective minimal medium (comprising 0.7% yeast nitrogen base, 2% glucose, supplemented with the essential amino 

5 acids and bases) and subsequently diluted ten times in YPGal medium (comprising 1 % yeast extract, 2% bacto pepton 
and 5% galactose). After 24 and 48 hours of growth, the culture supernatant of the colonies was analysed by ELISA for 
the presence of HC-V fragments which specifically bind to the antigens hCQ, RR6 in essential the same way as 
described above. In this case, however, the presence of specifically bound HC-V fragments was detected by incubation 
with monoclonal anti-myc antibodies, followed by incubation with poly-clonal rabbit-anti-mouse conjugate with alkaline 

10 phosphatase. In this way a number of anti-hCG and anti-RR6 HC-V fragments were isolated, which are: 

anti-RR6: 

R9 pUR4640 (seeFigurel) (SEQ. ID NO. 17. 18) 

15 

anti-hCG (alpha unit): 

H14 pUR4601 (see Figure 2) (SEQ. ID NO. 19, 20) 

20 c) Production of llama HC-V biheads by S. cerevisiae 

i> Construction of episomal expression plasmids encoding anti-hCG/anti-RR6 bispecH ic biheads 

[0057] In the anti-hCG HC-V fragment H14 (anti-alpha-subunit), the Pst\ site was removed and a Xhol site was 
25 introduced via PCR, using the primers: 



MPG158WB 

Xhol 

5 ' -GAATTMGCGGCCGCCCAGGTGAAACTGCTCGAGTCWGGGGGA-- 3 ' (SEQ. ID NO, 21) 



and 

35 

MPG159WB 

BstEII 

3 ' -CCCTGGGTCCAGTGGCAGAGGAGTGGCAGAGGAGTCTTGTTT-5 ' (SEQ. ID NO. 22) 

40 



[0058] In this way the sequence: 

45 

Pstl 

CAG GTC CAG CTG CAG GAG TCT GGG (SEQ. ID NO. 23) 

QVQLQESG 

50 

became 



Xhol 

CAG GTG AAA CTG CTC GAG TCW GGG 
QVKLLESG 



(SEQ. ID NO. 24) 
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10 



20 



25 



[0m UpondlgetfngthePCRfragm^ 

l Z gel electrophoresis and isolation from the gel. The fragments «M ££££ £SS*p Eaol - 
94/25591)which was digested w^ 

HMI II fragment of pJS2 was isolated and ligated in MUUM agl- ^i^SSsT, was digested with 
of which the Pst\ and BsrEH sites were remand las ^^^^^^^^^^M^h^^B 
BsfEII and HMM. after which the purified vector fragment was religated in the presence or sy 

following sequence: 

D CJ -T Hindlll 
Bs tEII ^" 49> _> (SEQ ID NO- 25) 

gScACCCTCTCCTC^^ 55 

' <_ • MPG 161 WB (4B) / 

VTVSSQVQLQESL* *LKL 

(SEQ. ID NO. 26) 

mid pUR4619 encodes a anti-hCG-anti.RR6 ^^« b, ^^!l^^ 2 7 28) 
L . i ~i iDActo- <a ir? - H14 - R9 - myc (see Figures 3-4/SEQ. ID NO. 27. 

ii) Production o1 the HC-V biheads 

kele<«cl Ire™ mini™i medium aga. plales as describecl n part ^^V^^ m ypoa medium. MM 

rabbit-anti-mouse conjugate with alkaline phosphatase. 

d) Anti-hCGtenti-RR6 bispecific biheads containing a linker peptide 

i) Cordon o,S.c^ 
ing a linker peptide 

' ft rrrssrr.TS'^asOTSSTWs 
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MVaJA 

BstELl Xbal Oralll PstI Hindlll 

5' CSTCACCGTCTCTAGATGGCCACCAGGnPGCAGCTGCAGGAGTCAACTTA 3' (SEQ. ID NO. 29) 



MVbJA 



[0064] This resulted in pSJ7a. in this plasmid the about 20 bp Pst\-Hind\\\ fragment was replaced with the about 
370 bp Pst\-Hind\\\ fragment encoding the anti-RR6 HC-V fragment R9 and/with the myc-tail of pUR4G40 (see Exam- 
ple 1 c(i)) and resulting in pSJ7b. 

[0065] Upon digesting plasmid pSJ7b with Xbal and Dra\\\ the about 7 kb vector fragment was ligated with five syn- 
thetic oligo nucleotide linker fragments presented below: 



MV01JA 5' CTAGTGGTACTTCCGGTTCCCAG 3' (SEQ. ID NO; 31) 



MV02JA 3' ACCATGAAGGCCAAGG 5' (SEQ. ID NO. 32) 



GCAGAGATCTACCGGTGGTCCACGTCGAGCTCCTCAGTTGAATTCGA. 5' 



(SEQ. ID NO. 30) 



S G T S G S Q 



MV03JA 



5' CTAGTTCTTCATCTGCTTCTGCCTCTTCAGCCCAG 3' 



(SEQ. ID NO. 33) 



MV04JA 



AAGAAGTAGACGAAGACGGAGAAGTCGG 5' 



(SEQ. ID NO. 34) 



SS5SASASSAQ 



MV05JA 



5'CTAGTGGTTCTCCAGGTTCACCAGGTCAG 3' 



(SEQ. ID NO. 35) 



MV06JA 



3' 



ACCAAGAGGTCCAAGTGGTCCA 5' 



(SEQ. ID NO. 36) 



SGSPGSPGQ 
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MV07JA 5' CTAGTGCTACTACAACTGGTTCTTCACCAGGTCCAACTCAG 3' 

(SEQ. ID NO. 37) 

MV08JA 3' ACGATGATGTTGACCAAGAAGTGGTCCAGGTTGA 5' 

(SEQ. ID NO. 38) 
SATTTGSSPGPTQ 

MV09JA 5' CTAGTGCTAATCATTCTGGTAATGCTTCTCAG 3' (SEQ. ID NO. 39) 



MV10JA 3' ACGATTAGTAAGACCATTACGAAGA 5' (SEQ. ID NO. 40) 



SANHSGNASQ 



[0066] The oligonucleotide linker fragments encode the last amino acid of the N-terminal HC-V fragment (S>and 
the first amino acid of the C-terminal HC-V fragment, intersected by the connecting linker peptide. This resulted in plas- 
mids pUR5330 to 5334, respectively. 

[0067] After transformation of S. cerevisiae with these plasmids, the production levels of the biheads were deter- 
mined via Western Wot analysis and a anti-hCG ELISA using anti-myc mAb for detection of the bound bihead (see 
Example 1 b(iii). Production levels are presented in Table 2 below: 



Table 2 



Plasmid 


Linker 


Production level (mg/l) 


PUR4619 


None 


11 


PUR5330 


S-G-T-S-G-S-Q 


36 


PUR5331 


S-S-S-S-A-S-A-S-S-A-Q 


49 


pUR5332 


S-G-S-P-G-S-P-G-Q 


33 


pURS333 


S-A-T-T-T-G-S-S-P-G-P-T-Q 


56 


PUR5334 


S-A-N-H-S-G-N-A-S-Q 


51 



[0068] The production levels of the biheads in which the two HC-V domains are separated by a linker peptide (con- 
sisting of between 5 and 1 1 amino acids) were found to be 3 to 5 times higher as found for the bihead in which the two 
HC-V fragments are connected without a peptide linker. 
[0069] Finally, the bispecif icity of the biheads was demonstrated as follows: 

[0070] PINs coated with hCG were incubated with (diluted) medium samples. Subsequently, the PINs were incu- 
bated with a RR6-alkaline phosphatase conjugate, in which the azo-dye RR6 was coupled to the alkaline phosphatase 
via its reactive triazine group. Finally, the alkaline phosphatase enzyme activity was determined after incubation of the 
PINs with p-nitro-phenyl phosphate and the optical density was measured at 405nm (see Figure 5). 

1.2 Purification of Llama Bi-heads with Various Linkers from S. cerevisiae Culture Media 

[0071] A 5 ml column of recombinant Protein A Fast Flow Sepharose (Amersham Pharmacia Biotech) was equili- 
brated by washing with 10 column volumes of wash buffer (10 mM potassium phosphate, pH 6), at a flow rate of 2 
ml/min. The bi-head fermentation broth was loaded at 2 ml/min in an upwards direction. After loading, the column was 
washed with wash buffer until the OD 280 reached the baseline. Elution was carried out with a linear gradient of 0 - 40 
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mM citric acid pH 2.5 in the reverse direction, collecting 4 ml fractions into tubes containing 400 uJ of a neutralising 
agent (1 M Tris.CI, pH 8.5) in order to minimise the effects of the acid. Peak fractions were checked for purity by running 
on a 12% SDS-PAGE Ready Gel (Bio-Rad) under standard denaturing conditions. Staining was with GelCode Blue 
(Pierce & Warriner). The fractions were concentrated using Macrosep centrifugal concentrators (3 kDa molecular 
5 weight cut-off, Pall Filtron Corp.) then buffer exchanged into 10mM potassium phosphate, pH 6 using PD-10 columns 
(Amersham Pharmacia Biotech). The final purity of the sample was determined by carrying out a UV scan from 400 - 
220 nm and using the value at 280 nm to determine an accurate concentration. The samples were then aliquoted into 
vials, frozen, freeze dried and stored until required. 

w 1 .3 Preparation of a Reactive Red 6-Bcvine Serum Albumin Conjugate 

[0072] A solution of Reactive Red 6 (RR6) was made up at 1 0 mg/ml in phosphate buffered saline (PBS). A solution 
of bovine serum albumin (BSA) was made up at 10 mg/ml in PBS. 200 ul of the RR6 solution was added to 800 ul of 
the BSA solution and the resulting solution was mixed in an end over end rotary mixer for 2 hours at room temperature. 
is RR6 that had conugated to BSA was separated from free RR6 by addition of the reaction mixture (1 ml) to a PD1 0 col- 
umn (Pharmacia) previously washed with 10 ml of PBS containing 0.1% sodium azide (PBSA). The column was then 
eluted by addition of PBSA (5 ml) and 1 ml aliquots were collected. The RR6-BSA conjugate eluted in fractions 4 and 
5. These were pooled and the concentration of protein was determined using a BCA protein test and the concentration 
adjusted to 2 mg/ml with PBSA. 

20 

1 .4 Adsorption of Latex with Reactive Red 6-Bovine Serum Albumin Conjugate 
[0073] Duke blue latex was adsorbed with the RR6-BSA conjugate as follows: 

[0074] To 950 jil of 10 mM borate buffer, 0.01 % merthiolate, pH 8.5 (buffer B) a 50 uJ aliqot of Duke blue latex (10 
25 % solids) was added and mixed by inverting. The diluted latex was centrifuged at 8000 g for 10 minutes at room tem- 
perature, the supernatant removed and the pellet vortexed briefly. The pellet was resuspended in 900 uJ of buffer B and 
to this 1 00 \jS of the previously prepared RR6-BSA conjugate was added. The latex solution was sonicated for 1 0 s using 
a sonic probe. The solution containing the latex was mixed for 2 h at room temperature and then centrifuged (8000 g, 
10 min at room temperature). The latex pellet was washed by resuspending in 1 ml of buffer B and centrifuged once 
30 more (8000 g, 10 min at room temperature). The pellet was then resuspended in 1 ml buffer B ready for use. 

1 .5 Analysis of Uama Bi-head Self Assembling on Reactive Red 6-Bovine Serum Albumin Adsorbed Latex 

[0075] The llama bi-heads were tested by self assembling onto RR6-BSA adsorbed latex and detection of hCG in 
35 a rapid assay technology (RAT) format. This was performed by mixing the llama bi-head (5 ul of a 0.1 mg/ml solution) 
with RRS^BSA adsorbed latex (5 ^l of 0.1 % solids) in 10 uJ of PBSA to which hCG (5 uJ of various concentrations) was 
added. 

[0076] The resulting solution was added to the bottom of a nitrocellulose strip (6 mm wide x 30 mm long) on which 
a monoclonal antibody recognising hCG had been adsorbed by plotting in a line (2.5 mg/ml) mid way up the strip. The 
40 latex-bi-head-hCG solution was allowed to flow up the nitrocellulose strip by capillary action and the strip was then 
washed by applying PBSA (25 pi) to the bottom of the strip. The amount of latex, captured at the plotted antibody line 
on the nitrocellulose strip, was quantified by measuring the absorbance through the strip. 

[0077] Figure 6 shows that the llama bi-heads with linkers 3, 4 and 5 gave the highest response in RAT assays. 
These linkers are structurally more ordered than the comparative examples, flexible linkers 2 and 6 and result in more 
4$ hCG and more latex captured in the assay. The more ordered linkers promote the correct orientation of the binding 
domains to achieve more optimal binding than when no linker is used. Linker 3 is derived from CWP1 and Linker 5 from 
CBH1P. 

[0078] Synthetic linkers with some order (linker 4 containing 2 proline residues) can offer increased sensitivity in 
assays than those with little order (linker 2). Figure 7 shows that the bi-head with linker 4 can detect lower amounts (50 
so mlU/ml) of hCG than the bi-head with linker 2 and, hence, give a more sensitive assay for hCG. 
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SEQUENCE LISTING 



<110> UNILEVER PLC 
UNILEVER N.V 

<120> ANTIGEN BINDING PROTEINS 
<130> T3077 

<140> 
<141> 

<160> 40 

<170> Patentln Ver. 2.1 

<21D> 1 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : LINKER 
<400> 1 

Gly Thr Ser Gly Ser 
1 s 



<210> 2 
<211> 9 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : LINKER 



Ser Ser Ser Ala Ser Ala Ser Ser Ala 
1 5 



<210> 3 
<211> 1 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 3 

Gly Ser Pro Gly Ser Pro Gly 
1 5 



<210> 4 
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<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 4 

Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr 
1 5 10 



<210> 5 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 5 

Ala Asn His Ser Gly Asn Ala Ser 
1 5 



<210> 6 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 6 

aggtsmarct gcagsagtcw gg 



<210> 7 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 7 

aacagttaag cttccgcttg cggccgcgga gctggggtct tcgctgtggt gcg 

<210> 8 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 8 
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aacagttaag cttccgcttg cggccgctgg ttgtggtttt ggtgtcttgg gtt 



<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 9 

gggaattcca ataggtggtt agcaatcg 



<210> 10 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<40D> 10 

gaccaacgtg gtcgcctggc aaaacg 



<210> 11 
<2I1> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 11 

cgttttgcca ggcgaccacg ttggtc 



<210> 12 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PRIMER 
<400> 12 

ccccaagctt acatggtctt aagttggcgt 



<210> IS 
<211> 1S5 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PLASMID 
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CONSTRUCT 

<220> 

<221> CDS 

<222> (3) ..(140) 

<400> 13 

ga get cat cac aca aac aaa caa aac aaa atg atg ctt ttg caa gec 47 
Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala 
15 10 15 

ttc ctt ttc ctt ttg get ggt ttt gca gee aaa ata tct gcg cag gtg 95 
Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys He Ser Ala Gin Val 
20 25 30 

cag ctg cag gag tea taa tga ggg acc cag gtc acc gtc tec tea 140 
Gin Leu Gin Glu Ser Gly Thr Gin Val Thr Val Ser Ser 

35 40 45 

taatgactta agctt 155 



<210> 14 
<211> 46 
<212> PRT 

<213> Artificial Sequence 
25 <223> Description of Artificial Sequence :PLASMID 

CONSTRUCT 

<400> 14 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
15 10 15 



Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val Gin 
20 25 30 

Leu Gin Glu -Ser Gly Thr Gin Val Thr Val Ser Ser 

35 40 45 



<210> 15 
<211> 188 
<212> DNA 

40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

45 <220> 

<221> CDS 

<222> (3).. (173) 

<400> 15 

50 ga get cat cac aca aac aaa caa aac aaa atg atg ctt ttg caa gec 47 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala 
15 10 15 
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tte ctt tec ctt ttg get «t «t gca gec aaa ata tct gcg eg «g 
Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie ser *ua 
20 25 

s a a as s " * a s a s s s a s a 

?c 40 



caa aaa etc ate tea gaa gag gat ctg aat taatgaetta agctt 
Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 



95 



143 



188 



SO 



<210> 16 
<211> 57 
<212> PRT 

<213> Artificial Sequence „T*euTrv 
<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

2? ^"His T hr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 

1 5 
Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val Gin 

20 25 
Leu Gin Glu Ser Gly Thr Gin Val Thr Val Ser ser Glu Gin 

35 40 



Lys Leu lie Ser Glu Glu Asp Leu Asn 
50 55 



<210> 17 
<211> 342 
<212> DNA 

<213> Artificial Sequence 

S Description of Artificial Sequence rPIASMID 
CONSTRUCT 

<220> 

<221> CDS 

<222> (IK- (342) 



ass a 2 a a a a a a 2 a a s a a >° 

2 2 S £ 2 S K K S S a S S S S " 

20 25 

s s a s s a a s a a a a s a a s " 4 
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35 40 45 



gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg aag 192 
Ala Thr lie Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 

ggc cga ttc acc ate tec aga gac aat gee aag aag aca gtg tat ttg 240 
Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 

gaa atg aac age ctg gaa cct gaa gac acg gec gtt tat tac tgt tac 288 
Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
95 90 95 

act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc gtc 336 
Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
100 105 110 

tec tea 342 
Ser Ser 



<210> 18 
<211> 114 
<212> PRT 

<213> Artificial Sequence 
25 <223> Description of Artificial Sequence: PLASMID 

CONSTRUCT 

<400> 18 

Gin Val Gin Leu Gin Glu Ser Gly Gly G).y Leu Val Gin Ala Gly Glu 
15 10 15 

30 

Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 
20 25 30 

Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
35 40 45 

Ala Thr He Asn Ser Arg Gly lie Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 

Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 

Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
85 90 95 

Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
45 100 105 110 

Ser Ser 



<210> 19 
<211> 351 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence :PLASMID 
s CONSTRUCT 

<220> 
<221> CDS 
<222> (1).. (351) 

10 <400> 19 

cag gtg cag ctg cag gag tea ggg gga gga ttg gtg cag gcg ggg ggc 
Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
15 10 15 



75 



20 



50 



55 



gtc acc gtc tec tea 
Val Thr Val Ser Ser 
115 



40 <210> 20 

<211> 117 



<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence :PLASMID 
CONSTRUCT 

<400> 20 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 

Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 



48 



96 



tct ctg aga etc tec tgt gca gec tct gga cgc acc ggc agt acg tat 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 

gac atg ggc tgg ttc cgc cag get cca ggg aag gag cgt gag tct gta 144 
Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 40 45 

gca get att aac tgg gat agt gcg cgc aca tac tat gca age tec gtg 192 
Ala Ala lie Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

25 agg ggc cga ttc acc ate tec aga gac aac gee aag aag acq gtg tat 

Arg Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 

ctg caa atg aac age ctg aaa cct gag gac acg gee gtt tat acc tgt 
Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 
30 85 90 95 

ggc gcg ggg gaa ggt ggt act tgg gac tec tgg ggc cag ggg acc cag 336 
Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 
100 105 HO 
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35 40 45 

Ala Ala lie Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

Arg Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 
85 90 95 

Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 
100 105 110 

Val Thr Val Ser Ser 
115 



<210> 21 
<211> 43 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 

25 <400> 21 

gaattaagcg gccgcccagg tgaaactgct cgagtcwggg gga 43 

<210> 22 
<211> 42 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 



<400> 22 

ccctgggtcc agtggcagag gagtggcaga ggagtcttgt tt 42 



<210> 23 
40 <211> 24 



<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 23 

caggtccagc tgcaggagtc tggg 24 



<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 24 

caggtgaaac tgctcgagtc wggg 



<210> 25 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER; DOUBLE 
STRANDED 

<220> 
<221> CDS 
<222> (2)..(55T 

<400> 25 

g gtc acc gtc tec tea cag gtg cag ctg cag gag tea ctg taa tga ctt 49 
Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Leu Leu 
1 5 10 15 

aag ctt S5 
Lys Leu 



<210> 26 
<211> 18 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: LINKER; DOUBLE 
STRANDED 

<400> 26 

Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Leu Leu 
15 10 15 

Lys Leu 



<210> 27 
<211> 672 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PLASWID 
CONSTRUCT 

<220> 
<221> CDS 
<222> (!)..( 672) 
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<400> 27 

etc gag tea ggg gga gga ttg gtg cag gcg ggg ggc tct ctg aga etc 48 
Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
1 5 10 15 

tec tgt gca gee tct gga cgc ace ggc agt acg tat gac atg ggc tgg 96 
Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 

ttc cgc cag get cca ggg aag gag cgt gag tct gta gca get att aac 144 
Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala He Asn 
35 40 45 

tgg gat agt gcg cgc aca tac tat gca age tec gtg agg ggc cga ttc 192 
Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 

ace ate tec aga gac aac gee aag aag acg gtg tat ctg caa atg aac 240 
Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 

age ctg aaa cct gag gac acg gec gtt tat ace tgt ggc gcg ggg gaa 288 
Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

25 ggt ggt act tgg gac tec tgg ggc cag ggg ace cag gtc acc gtc tec 336 

Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 110 

tea cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg 384 
Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
30 115 120 ^ 125 

gag tct ctg aaa etc tec tgt gca gee tct gga aac acc ttc agt ggc 432 
Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 

35 . 

ggc ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg 480 
Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 

gtc gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg 528 
40 Val Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val 

165 170 175 

aag ggc cga ttc acc ate tec aga gac aat gee aag aag aca gtg tat 576 
Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
u 180 185 190 

ttg gaa atg aac age ctg gaa cct gaa gac acg gee gtt tat tac tgt 624 
Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 200 205 

so tac act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc 672 

Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 
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<210> 28 
<211> 224 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence -.PLASMID 
CONSTRUCT 

<400> 28 

Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Giy Gly Ser Leu Arg Leu 
15 10 15 

Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 

Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala He Asn 
35 40 45 

Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 

Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 

Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 HO 

Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
115 120 . 125 

Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 

Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 t 150 155 160 

Val Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val 
165 170 175 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 

Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 200 205 

Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 



<210> 29 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : SYNTHETIC 
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INSERT 
<400> 29 

gtcaccgtct ctagatggcc accaggtgca gctgcaggag tcaactta 48 



<210> 30 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : SYNTHETIC 
INSERT 

<400> 30 

gcagagatct accggtggtc cacgtcgagc tcctcagttg aattcga 47 



<210> 31 
20 <211> 23 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 31 

ctagtggtac ttccggttcc cag 23 



<210> 32 
30 <211> 16 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 32 

accatgaagg ccaagg 16 

40 <210> 33 

<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence: LINKER 

<400> 33 

ctagttcttc atctgcttct gcctcttcag cccag 35 



<210> 34 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 34 

aagaagtaga cgaagacgga gaagtcgg 



<210> 35 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 35 

ctagtggttc tccaggttca ccaggtcag 



<210> 36 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 36 

accaagaggt ccaagtggtc ca 



<210> 37 
<211> 41 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : LINKER 
<400> 37 

ctagtgctac tacaactggt tcttcaccag gtccaactca g 



<210> 38 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 38 

acgatgatgt tgaccaagaa gtggtccagg ttga 



<210> 39 
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<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 39 

ctagtgctaa tcattctggt aatgcttctc ag 



<210> 40 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 40 

acgattagta agaccattac gaaga 
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SEQUENCE LISTING 



<110> UNILEVER PLC 
UNILEVER N.V 

<120> ANTIGEN BINDING PROTEINS 

<130> T3077 

<140> 
<141> 

<160> 45 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : LINKER 
<400> 1 

Gly Thr Ser Gly Ser 
1 5 



<210> 2 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 

<400> 2 ~ 
Ser Ser Ser Ala Ser Ala Ser Ser Ala 
1 5 



<210> 3 
<211> 1 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 3 

Gly Ser Pro Gly Ser Pro Gly 
1 5 



<210> 4 
<211> 11 
<212> PRT 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: PRIMER 
<400> 4 
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Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr 
15 10 



<210> 5 
<211> 8 
<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> Description of Artificial Sequence : PRIMER 



15 



20 



30 



<400> 5 

Ala Asn His Ser Gly Asn Ala Ser 
1 5 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 6 

aggtsmarct gcagsagtcw gg 22 

<210> 7 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 7 

aacagttaag cttccgcttg cggccgcgga gctggggtct tcgctgtggt gcg 53 

35 <210> 8 

<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: PRIMER 

<400> 8 

aacagttaag cttccgcttg cggccgctgg ttgtggtttt ggtgtcttgg gtt 53 

45 <210> 9 

<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 9 

gggaattcca ataggtggtt agcaatcg 28 



55 
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<210> 10 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 10 

gaccaacgtg gtcgcctggc aaaacg 

<210> 11 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 11 

cgttttgcca ggcgaccacg ttggtc 

<210> 12 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PRIMER 
<400> 12 

ccccaagctt acatggtctt aagttggcgt 

<210> 13 
<211> 155 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<220> 

<221> CDS 

<222> (3).. (110) 

<220> 

<221> CDS 

<222> (117).. (140) 

<220> 

<221> CDS 

<222> (14?>..(155> 

2°St l Lt cac aca aac aaa caa aac aaa atg atg ctt ttg caa gcc 
* Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala 
I 5 10 

ttc ctt ttc ctt ttg get ggt ttt gca gcc aaa ata tct gcg cag gtg 
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Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys lie Ser Ala Gin Val 
20 25 30 

cag ctg cag gag tea taatga ggg acc cag gtc acc gtc tec tea taatga 146 
Gin Leu Gin Glu Ser Gly Thr Gin Val Thr Val Ser Ser 

35 40 

ctt aag ctt 155 
Leu Lys Leu 
45 



<210> 14 
<211> 36 
<212> PRT 

75 <213> Artificial Sequence 

<223> Description of Artificial Sequence :PLASMID 
CONSTRUCT 



<400> 14 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
1 5 10 15 

Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys He Ser Ala Gin Val Gin 
20 25 30 

Leu Gin Glu Ser 
35 



<210> 15 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
30 <223> Description of Artificial Sequence: PLASMID 

CONSTRUCT 

<400> 15 

Gly Thr Gin Val Thr Val Ser Ser 
1 5 

35 

<210> 16 
<211> 3 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: PLASMID 
40 CONSTRUCT 

<400> 16 
Leu Lys Leu 
1 



<210> 17 
<2U> 188 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PLASMID 
CONSTRUCT 

<220> 
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<221> CDS 
<222> (3).. (110) 

<220> 

<221> CDS 

<222> {117).. (173) 

<220> 

<221> CDS 

<222> (180).. (188) 

<400> 17 

ga get cat cac aca aac aaa caa aac aaa atg atg ctt ttg caa gec 
Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala 
15 10 15 

ttc ctt ttc ctt ttg get ggt ttt gca gec aaa ata tct gcg cag gtg 
Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys He Ser Ala Gin Val 
20 25 30 

cag ctg cag gag tea taatga ggg acc cag gtc acc gtc tec tea gaa 
Gin Leu Gin Glu Ser Gly Thr Gin Val Thr Val Ser Ser Glu 

35 40 45 

caa aaa etc ate tea gaa gag gat ctg aat taatga ctt aag ctt 
Gin Lys Leu He Ser Glu Glu Asp Leu Asn Leu Lys Leu 

50 55 



<210> 18 
<211> 36 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMID 
CONSTRUCT 

<400> 18 

Ala His His Thr Asn Lys Gin Asn Lys Met Met Leu Leu Gin Ala Phe 
15 10 15 

Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys He Ser Ala Gin Val Gin 
20 25 30 

Leu Gin Glu Ser 
35 



<210> 19 
<211> 19 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMI D 
CONSTRUCT 

<400> 19 

Gly Thr Gin Val Thr Val Ser Ser Glu Gin Lys Leu He Ser Glu Glu 
15 10 15 

Asp Leu Asn 



<210> 20 
<211> 3 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence: PLASMID 
CONSTRUCT 

<400> 20 
Leu Lys Leu 
1 



<210> 21 
<211> 342 
<212> DNA 

<213> Artificial Sequence 

15 <220> 

<223> Description of Artificial Sequence: PLASMID 
CONSTRUCT 

<220> 
<221> CDS 
<222> (1)..(342) 

20 

<400> 21 

cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg gag 48 
Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Glu 
15 10 15 

25 tct ctg aaa etc tec tgt gca gee tct gga aac ace ttc agt ggc ggc 96 

Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 
20 25 30 

ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg gtc 144 
Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
35 40 45 

30 

gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg aag 192 
Ala Thr lie Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val Lys 
50 55 60 

ggc cga ttc acc ate tec aga gac aat gee aag aag aca gtg tat ttg 240 
35 Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 

65 70 75 80 

gaa atg aac age ctg gaa cct gaa gac acg gee gtt tat tac tgt tac 288 
Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
85 90 95 



act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc gtc 336 
Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin' Gly Thr Gin Val Thr Val 
100 105 110 

tec tea 342 
Ser Ser 



<210> 22 
<211> 114 
<212> PRT 

<213> Artificial Sequence 
50 <223> Description of Artificial Sequence: PLASMID 

CONSTRUCT 

<400> 22 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Glu 
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15 10 15 

5 Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly Gly 

20 25 30 

Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu Val 
35 40 45 

Ala Thr lie A3n Ser Arg Gly lie Thr Asn Tyr Ala Asp Phe Val Lys 
10 50 55 60 

Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu 
65 70 75 80 



15 



20 



25 



30 



35 



40 



45 



Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys Tyr 
85 90 95 

Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr Val 
100 105 110 

Ser Ser 



<210> 23 
<211> 351 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PLASMID 
CONSTRUCT 

<220> 
<221> CDS 
<222> (1)..(351) 

<400> 23 

cag gtg cag ctg cag gag tea ggg gga gga ttg gtg cag gcg ggg ggc 48 

Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 

15 10 15 

tct ctg aga etc tec tgt gca gec tct gga cgc acc ggc agt acg tat 96 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 

gac atg ggc tgg ttc cgc cag get cca ggg aag gag cgt gag tct gta 144 
Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 40 45 

-s. 

gca get att aac tgg gat agt gcg cgc aca tac tat gca age tec gtg 192 
Ala Ala lie Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

agg ggc cga ttc acc ate tec aga gac aac gee aag aag acg gtg tat 240 
Arg Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 

ctg caa atg aac age ctg aaa cct gag gac acg gec gtt tat acc tgt 288 
Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 
85 90 95 

50 ggc gcg ggg gaa ggt ggt act tgg gac tec tgg ggc cag ggg acc cag 336 

Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 
100 105 110 
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gtc acc gtc tec tea 351 
Val Thr Val Ser Ser 
115 

<210> 24 
<211> 117 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence :PLASMID 
CONSTRUCT 

<400> 24 

Gin Val Gin Leu Gin Giu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr 
20 25 30 

Asp Met Gly Trp Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val 
35 40 45 

Ala Ala lie Asn Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val 
50 55 60 

Arg Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
65 70 75 80 

25 Leu Gin Met Asn Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys 

85 90 95 

Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin 
100 105 110 

30 Val Thr Val Ser Ser 

115 

<210> 25 
<211> 43 
<212> DNA 

<213> Artificial Sequence 



10 



15 



20 



35 



<22Q> 

<223> Description of Artificial Sequence : PRIMER 
<400> 25 

40 gaattaagcg gccgcccagg tgaaactget cgagtcwggg gga 43 

<210> 26 
<2U> 42 
<212> DNA 

45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : PRIMER 



50 



55 



<400> 26 

ccctgggtcc agtggcagag gagtggcaga ggagtcttgt tt 42 
<210> 27 
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<211> 24 
<212> DNA 

<2X3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PRIMER 
<400> 27 

caggtccagc tgcaggagtc tggg 



<210> 28 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 28 

caggtgaaac tgctcgagtc wggg 



<210> 29 
<211> S5 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER; DOUBLE 
STRANDED 

<220> 
<221> CDS 
<222> (2).. (40) 

<220> 

<221> CDS 

<222> (47).. (55) 

<400> 29 

g gtc acc gtc tec tea cag gtg cag ctg cag gag tea ctg taatga ctt 
Val Thr Val Ser Ser Gin Val Gin Leu Gin Glu Ser Leu Leu 
15 10 

aag ctt 
Lys Leu 
15 



<210> 30 
<211> 13 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : LINKER; DOUBLE 
STRANDED 

<400> 30 

Val Thr Vat Ser Ser Gin Val Gin Leu Gin Glu Ser Leu 
1 S 10 



<210> 31 
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15 



35 



40 



45 



<2ll> 3 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: LINKER; DOUBLE 
STRANDED 



<400> 31 
Leu Lys Leu 
10 1 



<210> 32 
<211> 672 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PLASMID 
CONSTRUCT 

<220> 

20 <221> CDS 

<222> (1)..(672) 

<400> 32 

etc gag tea ggg gga gga ttg gtg cag gcg ggg ggc tct ctg aga etc 48 
Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
15 10 15 

25 

tec tgt gca gee tct gga cgc ace ggc agt a eg tat gac atg ggc tgg 96 
Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 

ttc cgc cag get cca ggg aag gag cgt gag tct gta gca get att aac 144 
Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala lie Asn 
30 35 40 45 

tgg gat agt gcg cgc aca tac tat gca age tec gtg agg ggc cga ttc 192 
Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 



ace ate tec aga gac aac gee aag aag acg gtg tat ctg caa atg aac 240 
Thr lie Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 

age ctg aaa cct gag gac acg gee gtt tat ace tgt ggc gcg ggg gaa 288 
Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

ggt ggt act tgg gac tec tgg ggc cag ggg acc cag gtc acc gtc tec 336 
Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 110 

tea cag gtg cag ctg cag gag tea ggg gga ggc ttg gtg cag get ggg 384 
Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
115 120 125 

gag tct ctg aaa etc tec tgt gca gec tct gga aac acc ttc agt ggc 432 
Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 

50 ggc ttc atg ggc tgg tac cgc cag get cca ggg aag cag cgc gag ttg 480 

Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 
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gtc gca acc att aat agt aga ggt ate aca aac tat gca gac ttc gtg 
Val Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val 
165 170 175 

aag ggc cga ttc acc ate tec aga gac aat gec aag aag aca gtg tat 
Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 

ttg gaa atg aac age ctg gaa cct gaa gac acg gec gtt tat tac tgt 
Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 200 205 

tac act cac tac ttc aga tec tac tgg ggt cag ggg acc cag gtc acc 
Tyr Thr His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 



<210> 33 
<211> 224 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : PLASMIO 
CONSTRUCT 

<400> 33 

Leu Glu Ser Gly Gly Gly Leu Val Gin Ala Gly Gly Ser Leu Arg Leu 
15 10 15 

Ser Cys Ala Ala Ser Gly Arg Thr Gly Ser Thr Tyr Asp Met Gly Trp 
20 25 30 

Phe Arg Gin Ala Pro Gly Lys Glu Arg Glu Ser Val Ala Ala He Asn 
35 40 45 

Trp Asp Ser Ala Arg Thr Tyr Tyr Ala Ser Ser Val Arg Gly Arg Phe 
50 55 60 

Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr Leu Gin Met Asn 
65 70 75 80 

Ser Leu Lys Pro Glu Asp Thr Ala Val Tyr Thr Cys Gly Ala Gly Glu 
85 90 95 

Gly Gly Thr Trp Asp Ser Trp Gly Gin Gly Thr Gin Val Thr Val Ser 
100 105 HO 

Ser Gin Val Gin Leu Gin Glu Ser Gly Gly Gly Leu Val Gin Ala Gly 
115 120 - 125 

Glu Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Asn Thr Phe Ser Gly 
130 135 140 

Gly Phe Met Gly Trp Tyr Arg Gin Ala Pro Gly Lys Gin Arg Glu Leu 
145 150 155 160 

Val Ala Thr He Asn Ser Arg Gly He Thr Asn Tyr Ala Asp Phe Val 
165 170 175 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala Lys Lys Thr Val Tyr 
180 185 190 

Leu Glu Met Asn Ser Leu Glu Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
195 200 205 
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Tyr Thx His Tyr Phe Arg Ser Tyr Trp Gly Gin Gly Thr Gin Val Thr 
210 215 220 

<210> 34 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SYNTHETIC 
INSERT 

<400> 34 

gtcaccgtct ctagatggcc accaggtgca gctgcaggag tcaactta 48 

<210> 35 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SYNTHETIC 
INSERT 

<400> 35 

gcagagatct accggtggtc cacgtcgagc tcctcagttg aattcga 47 

<210> 36 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> Description of Artificial Sequence : LINKER 

<400> 36 

ctagtggtac ttccggttcc cag 23 



10 



15 



20 



25 



35 



40 



45 



50 



<210> 37 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 37 

accatgaagg ccaagg 16 

<210> 38 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 38 
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ctagttcttc atctgcttct gcctcttcag cccag 



<210> 39 
<2U> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 39 

aagaagtaga cgaagacgga gaagtcgg 



<210> 40 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 40 

ctagtggttc tccagqttca ccaggtcag 



<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 41 

accaagaggt ccaagtggtc ca 



<210> 42 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : LINKER 
<400> 42 

ctagtgctac tacaactggt tcttcaccag gtccaactca g 



<210> 43 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.LINKER 
<400> 43 

acgatgatgt tgaccaagaa gtggtccagg ttga 



<210> 44 
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5 <211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: LINKER 
<400> 44 

ctagtgctaa tcattctggt aatgcttctc ag 32 



15 <210> 45 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: LINKER 
<400> 45 

acgattagta agaccattac gaaga 25 



25 



Claims 

30 1 . Use of a polypeptide group, the amino acid sequence of which group confers restricted conformational flexibility, as 
a linking group to link binding units in a multivalent binding protein. 

2. Use according to claim 1 wherein the polypeptide linking group comprises from 4 to 30 amino acid residues. 

35 3. Use according to claim 1 or 2 wherein the linking group comprises one or more proline residues. 

4. Use according to claim 1 or 2 wherein the linking group comprises an amino acid sequence selected from: 

S-S-S-A-S-A-S-S-A, 
40 G-S-P-G-S-P-G, or 

A-T-T-T-G-S-S-P-G-P-T. 

5. A multivalent binding protein comprising a plurality of binding units linked by means of intervening polypeptide 
linker groups, the amino acid sequence of which linker group confers restricted conformational flexibility. 

45 

6. A protein according to claim 5 wherein the binding units comprise heavy chain variable domains derived from an 
immunoglobulin naturally devoid of light chains. 

7. A protein according to claim 5 or claim 6 wherein the antigen binding units comprise heavy chain variable domains 
so derived from a Camelid immunoglobulin. 

8. A protein according to any one of claims 5 to 7 comprising a bivalent antigen binding protein. 

9. A protein according to any one of claims 5 to 8 wherein the linker group comprises from 4 to 30 amino acid resi- 
55 dues. 

10. A protein according to any one of claims 5 to 9 wherein the linker group comprises one or more proline residues. 
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11. A protein according to any one of claims 5 to 9 wherein the linker group comprises an amino acid sequence 
selected from: 

S-S-S-A-S-A-S-S-A, 
5 G-S-P-G-S-P-G, or 

A-T-T-T-G-S-S-P-G-P-T. 

12. Nucleotide sequences encoding for a multivalent binding protein of any one of claims 5 to 1 1 . 
w 1 3. An expression vector comprising a nucleotide sequence according to claim 1 2. 

14. A host cell transformed with a vector according to claim 13. 
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i 



90 



PstI 

CAGGTGCAGCTGCAGGAGTCAGGGGGAGGCTTGGTGCAGGCTGGGGAGTCTCTGAAACTCTCCTGTGCAGCCTCTGGAAACACCTTCAGT 

+ + ♦ + + + + + + 

GTCCACGTCGACGTCCTCAGTCCCCCTCCGAACCACGTCCGACCCCTCAGAGACT7TGAGAGGACACGTCGGAGACCTTTGTGGAAGTCA 
QVQLQESGGGLVQAGESLKLSCAASGNTFS 

[-> CDR I 

Kpnl 

GGCGGCTTCATGGGCTGGTACCGCCAGGCTCCAGGGAAGCAGCGCGAGTTGGTCGCAACCATTAATAGTAGAGGTATCACAAACTATGCA 

91 + + + + ♦ +— + + + 180 

CCGCCGAAGTACCCGACCATGGCGGTCCGAGGTCCCTTCGTCGCGCTCAACCAGCGTTGGTAATTATCATCTCCATAGTGTTTGATACGT 
GGFMGWYRQAPGKQRELVATI NSRGITNYA 
<-) I-> CDR II 

EagI 

GACTTCGTGAAGGGCCGATTCACCATCTCCAGAGACAA7GCCAAGAAGACAGTGTATTTGGAAATGAACAGCCTGGAACCTGAAGACACG 



181 ♦ ~+ ♦ ♦ ♦ + + ♦ 

CTGAAGCACTTCCCGGCTAAGTGGTAGAGGTCTCTGTTACGGTTCTTC7GTCACATAAACCTTTACTTGTCGGACCTTGGACTTCTGTGC 
DFVKGRFTI SRDNAKKTVYLEMKSLEPEDT 
<-] 

BstEII 

GCCGTTTATTACTGTTACACTCACTACTTCAGATCCTACTGGGGTCAGGGGACCCAGGTCACCGTCTCCTCA 
271 ♦ — ♦ ♦ - + — 342 

CGGCAAATAATGACAATGTGAGTGATGAAGTCTAGGATGACCCCAGTCCCCTGGGTCCAGTGGCAGAGGAGT 
AVYYCYTHYFRSYWGQGTQVTVSS 
(-> CDR III <-] 



270 



Figure 1. 
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PstI 

CAGGTGCAGCTGCRGGAGTCAGGGGGAGGATTGGTGCAGGCGGGGGGCTCTCTGAGACTCTCCTGTGCAGCCTCTGGACGCACCGGCAGT 



GTCCACGTCGACGTCCTCAGTCCCCCTCCTAACCACGTCCGCCCCCCGAGAGACTCTGAGAGGACACGTCGGAGACCTGCGTGGCCGTCA 
QVQLQESGGGLVQAGGSLRLSCAASGRTGS 



ACG7ATGA<:ATGGG^TGGTTCCGCCAGGCTCCAG^GAAGGAGCGTGAGTCTGTAGCAGCTATTAACTGGGATAGTGCGCGCACATACTAT 

91 ___ _ + + + + + + + + ♦ 180 

TGCATACTGTACCCGACCAAGGCGGTCCGAGGTCCCTTCCTCGCACTCAGACATCGTCGATAATTGACCCTATCACGCGCGTGTATGATA 
TYDMGWFR&APGKERESVAAINWDSARTYY 
l-> CDR I <-l 1"> CDR " 

EagI 

GCAAGCTCCGTGAGGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAAGACGGTGTATCTGCAAATGAACAGCCTGAAACCTGAGGAC 

181 + 1 ♦ 4 — ♦ * * * 270 

CGTTCGAGGCACTCCCCGGCTAJWaTGGTAGAGGTCTCTGTTGCGGTTCTTCTGCCACATAGACGTTTACTTGTCGGACTTTGGACTCCTG 
ASSVRGRFTISRONAKKTVYLQMNSLKPED 

<-l 

BstEII 

ACGGCCGTTTATACCTGTGGCGCGGGGGAAGGTGGTACTTGGGACTCCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA 

271 + + + 4 ♦ ♦ ♦ 351 

TGCCGGCAAATATGGACACCGCGCCCCCTTCCACCATGAACCCTGAGGACCCCGGTCCCCTGGGTCCAGTGGCAGAGGAGT 

TAVYTCGAGEGGTWDSWGQGTQVTVSS 
l-> CDR III <"1 



Figure 2. 
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Xhol 

CTCGAGTCAGGGGGAGGATTGGTGCAGGCGGGGGGCTCTCTGAGACTCTCCTGTGCAGCCTCTGGACGCACCGGCAGTACGTA7GACATG 

i + + + + + +. * + + 90 

GAGCTCAGTCCCCCTCCTAACCACGTCCGCCCCCCGAGAGACTCTGAGAGGACACGTCGGAGACCTGCGTGGCCGTCATGCATACTGTAC 
LESGGGLVQAGGSLRLSCAASGRTGSTYDM 

l-> CDR I 

GGCTGGTTCCGCCAGGCTCCAGGGAAGGAGCGTGAGTCTGTAGCAGCTATTAACTGGGATAGTGCGCGCACATACTATGCAAGCTCCGTG 

. + + ♦ + 160 

91 + + + + * 

CCGACCAAGGCGGTCCGAGGTCCCTTCCTCGCACTCAGACATCGTCGATAATTGACCCTATCACGCGCGTGTATGATACGTTCGAGGCAC 
GWFRQAPGKERESVAAINWDSARTYYASSV 
< _ 1 l-> CDR II 

EagI 

AGGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAAGACGGTG7ATCTGCAAATGAACAGCCTGAAACCTGAGGACACGGCCGTTTAT 

. + + + +■ ♦ 270 

181 + ♦ + + + + 

TCCCCGGCTAAGTGGTAGAGGTCTCTGT7GCGGTTCTTCTGCCACATAGACGTTTACT7GTCGGACTTTGGACTCCTGTGCCGGCAAATA 
RGRFTISRDNAXKTVYLQMKSLKPEDTAVY 

<-l 

BstEII i P»tl 

ACCTGTGGCGCGGGGGAAGGTGGTACTTGGGACTCCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACAGGTGCAGCTGCAGGAGTCA 

. + + +■ + 360 

271 + * + — + + 

TGGACACCGCGCCCCCTTCCACCATGAACCCTGAGGACCCCGGTCCCCTGGGTCCAGTGGCAGAGGAGTGTCCACGTCGACGTCCTCAG7 

TCGAGEGG7WDSWGaG7QV7VSSQV^LQES 
l-> CDR III <-l 

Kpnl 

GGGGGAGGCTTGG7GCAGGCTGGGGAGTC7C7GAAAC7CTCCTGTGCAGCC7C7GGAAACACCTTCAGTGGCGGC7TCATGGGCTGG7AC 

, , + + ♦ + 450 

361 * + +— — + +— + 

CCCCCTCCGAACCACG7CCGACCCCTCAGAGAC77TGAGAGGACACG7CGGAGACC777G7GGAAG7CACCGCCGAAG7ACCCGACCATG 
GGGLVQAGESLKLSCAASGN7FSGGFMGWY 

l-> CDR I <"1 

CGCCAGGC7CCAGGGAAGCAGCGCGAG77GG7CGCAACCA77AATAGTAGAGG7ATCACAAACTA7GCAGACTTCGTGAAGGGCCGATTC 

. _ „ , + + + +- 540 

451 + * + + + 

GCGGTCCGAGG7CCC77CG7CGCGC7CAACCAGCGT7GGTAA7TATCATCTCCA7AGTGTTTGA7ACGTCTGAAGCACT7CCCGGC7AAG 
RQAPGKQRELVA7INSRGI7NYADFVKGRF 

l-> CDR II 

EagI 

ACCATC7CCAGAGACAA7GCCAAGAAGACAG7G7A777GGAAA7GAACAGCC7GGAACC7GAAGACACGGCCG777A77ACTG77ACACT 

+ + _ + + + + + * ♦ 630 

TGGTAGAGGTC7C7G77ACGG77CT7C7G7CACA7AAACC777AOT 

TISRDKAKK7VYLEMNSLEPED7AVYYCY7 

BstEII 

CAC7AC77CAGA7CC7AC7GGGG7CAGGGGACCCAGG7CACC 



631 



672 



GTGATGAAG7CTAGGA7GACCCCAG7CCCC7GGG7CCAG7GG 
HYFRSYWGQG7QV7 
l-> CDR III <-l 



Figure 4. 
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1/2 1/8 1/32 1/124 1/512 

1/4 1/16 1/64 1/256 1/1028 
dilutions 



pUR4619 pUR5330 pUR5331 pUR5332 



PUR5333 pUR5334 blanc 



Figure 5. 
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Linker 



Figure 
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Figure 7. 
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